arXiv:l504.07617v2 [hep-ph] 30 Oct 2015 


Prepared for submission to JHEP 


Heavy Higgs Bosons at 14TeV and 100 TeV 


Jan Hajer, a,fc Ying-Ying Li, a Tao Liu, a John F.H. Shiu a 

a Department of Physics, The Hong Kong University of Science and Technology, 

Clear Water Bay, Kowloon, Hong Kong S.A.R., P.R.C. 

b Jockey Club Institute for Advanced Study, The Hong Kong University of Science and Technology, 
Clear Water Bay, Kowloon, Hong Kong S.A.R., P.R.C. 

E-mail: jan.hajerOust .hk, ylict@connect.ust.hk, taoliu@ust.hk, 
fhshiu@ust.hk 

Abstract: Searching for Higgs bosons beyond the Standard Model (BSM) is one of the 
most important missions for hadron colliders. As a landmark of BSM physics, the MSSM 
Higgs sector at the LHC is expected to be tested up to the scale of the decoupling limit of 
0(1) TeV, except for a wedge region centered around tan (3 ~ 3 — 10, which has been known 
to be difficult to probe. In this article, we present a dedicated study testing the decoupled 
MSSM Higgs sector, at the LHC and a next-generation pp-collider, proposing to search in 
channels with associated Higgs productions, with the neutral and charged Higgs further 
decaying into tt and tb, respectively. In the case of neutral Higgs we are able to probe 
for the so far uncovered wedge region via pp —> bbH/A —> bbtt. Additionally, we cover the 
the high tan /? range with pp —>• bbH/A -A bbrr. The combination of these searches with 
channels dedicated to the low tan (3 region, such as pp H/A tt and pp —>• ttH/A —> tttt 
potentially covers the full tan /3 range. The search for charged Higgs has a slightly smaller 
sensitivity for the moderate tan (3 region, but additionally probes for the higher and lower 
tan (3 regions with even greater sensitivity, via pp -A tbH± —> tbtb. While the LHC will 
be able to probe the whole tan [3 range for Higgs masses of 0(1) TeV by combining these 
channels, we show that a future 100 TeV pp-collider has a potential to push the sensitivity 
reach up to ~ 0(10) TeV. In order to deal with the novel kinematics of top quarks produced 
by heavy Higgs decays, the multivariate Boosted Decision Tree (BDT) method is applied in 
our collider analyses. The BDT-based tagging efficiencies of both hadronic and leptonic 
top-jets, and their mutual fake rates as well as the faking rates by other jets (h, Z, W, b, 
etc.) are also presented. 
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1 Introduction 

With the discovery of a 125 GeV Higgs boson and rapid progress made at the Large Hadron 
Collider (LHC), it is time for the high-energy physics community to chart a road-map for 
the next decade or the next few decades. In addition to a high-luminosity LHC program, 
two preliminary proposals, the Future hadron-hadron Circular Collider (Fcchh) program 
at CERN [1] and the Super-pp-Collider (SppC) [2] program in China, have been made, 
both of which involve construction of a 50-100 TeV pp collider [3] (below we will universally 
assume a 100TeV machine). With more accumulated data or higher expected energy scale, 
the high-luminosity LHC and the next-generation pp-colliders offer great opportunities for 
the search for physics up to and beyond TeV scale, respectively, including additional Higgs 
bosons. (For recent studies on physics at a next-generation pp-collider, e.g, see [4-15]) 

An extended Higgs sector extensively exists in physics beyond the Standard Model 
(BSM), such as supersymmetric theories [16] or composite Higgs models [17], because of 
requirements by either symmetry or phenomenology. The additional Higgs fields fill up 
singlet, doublet, triplet or some other representations of electroweak (EW) gauge symmetry, 
generically yielding interactions with the SM sector which are characterized by their electric 
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Channels 
pp —>• bbH/A 


bbrr, bbbb* 


pp —>• bbH/A —»■ bbt.t 


pp -» H/A —»■ tt*. pp —»■ ttH/A 
pp —> tbH ^ —>• tbtb, thru 


■ tttt* 


Table 1: Main channels to cover or potentially cover various tan /? regions in the decoupling limit 
of the MSSM Higgs sector. The channels marked by “*” are not covered in collider analyses in this 
article. 


charges and CP-structures. Searching for these new Higgs bosons, therefore, provides an 
unambiguous way to probe for new physics and is one of the top missions of hadron colliders. 

At the LHC, both the ATLAS and the CMS collaborations have started their searches 
for neutral, singly-charged or doubly-charged Higgs bosons, in various channels [18-23]. 
Within the next decade, tests up to 0(1) TeV are expected in a general context. Additionally, 
the energy scale accessible to the next-generation pp-collider is several times higher than 
that of the LHC, which enables us to search for a much heavier Higgs sector. In both cases, 
some decay modes kinematically suppressed in the low mass domain are fully switched 
on. Concurrently, the kinematics of their decay products can be dramatically changed. 
A systematic study of decoupled or heavy Higgs sectors at hadron colliders is therefore 
essential for both the high-luminosity LHC program and the proposals for next-generation 
pp-colliders. 

This motivates the studies in this article. We focus on searches for neutral and singly- 
charged heavy Higgs bosons, and will analyze the sensitivity reach that might be achieved 
at both, the (HL-)LHC and a 100 TeV pp collider, using the Minimal Supersymmetric 
Standard Model (MSSM) for illustration. In the MSSM, there are in total five Higgs bosons: 
three of them are neutral (two CP-even and one CP-odd, or three CP-mixing ones) and 
two of them are charged. Because of the limitation in the energy reach, the LHC searches 
mainly focus on a mass domain below the TeV scale. For constraints on the MSSM Higgs 
sector based on these searches see e.g. [24, 25]. For searches including light gauginos and 
higgsinos see for example [26, 27]. 

For neutral Higgs bosons, pp —> bbH/A —> bbrr and pp —^ H —» VV, hh, tt together 
with pp —> A —> hZ, tt yield or might yield the best sensitivities in the large tan (3 
region (tan (3 > 10) as well as the small tan/3 region (tan/3 < 3), respectively. Whereas 
the moderate tan (3 region (tan [3 ~ 3 — 10) is difficult to probe, yielding a well-known 
untouched “wedge” region (for recent discussions, e.g., see [24, 28]). As for charged Higgs 
bosons, pp —>• tbH± —>• thru plays a crucial role in probing the small tan (3 region with 
m, H ± < mt + mb as well as the large tan (3 region, because the coupling of H± with tv is 
tan /3-enhanced. 

In the decoupling limit, the decays of H/A —>• tt and tb are fully switched on, if 

the SUSY sector is decoupled as well, whereas the decays of H/A —>• VV, hh, A —>• hZ, and 
H± —)■ hW ± are generically suppresse [29]. Br (H/A —>• tt) becomes sizable for moderate 


- 2 - 











tan f3 and dominant for low tan (3, and Br (H± —> tb ) becomes dominant for the whole tan /3 
region. However, it is known that the signal in the channel pp —^ H/A —» tt and the QCD 
tt background have strong interference effects [30, 31]. The search in this channel therefore 
is extremely challenging. Instead we propose in this article to test a heavy Higgs sector 
in channels with associated Higgs productions, where the interference effects between the 
signal and the QCD background are much less severe, compared to that in pp —^ H/A —>• tt. 

Interestingly, the channels pp —>• bbH/A —>• bbtt and pp —>• ttH/A —>• tttt have a cross 
section maximized in the moderate tan /3 and low tan (3 regions, respectively. A combination 
of the channels of pp —t bbH/A — > bbrr (or pp — > bH/A — > bbb [32]), pp —> bbH/A —> bbtt 
and pp —>• ttH/A —>• tttt or pp —>• H/A —>• tt (if the interference structure can be efficiently 
identihed) thus may yield a full coverage for the tan (3 domain in searching for neutral Higgs 
bosons, if the moderate and low tan f3 regions can be probed in the two latter channels 
efficiently. For the charged Higgs boson searches pp —>• tbH ^ —> tbtb is the golden channel 
for both high and low tan j3 domains. 

The main channels for the MSSM Higgs searches, which will be explored in this 
paper (except the ones marked with are summarized in Table 1. These channels are 
characterized by two classes of kinematics: 

1. Kinematics related to the heaviness of the BSM Higgs bosons, such as highly boosted 
top quarks in Higgs decay. Looking into internal structure of the boosted objects or 
defining boostness-based variables can efficiently suppress the related backgrounds. 

2. Kinematics related to the particles accompanying Higgs production, e.g., the for¬ 
wardness/backwardness of the two 6-jets accompanying the Higgs production in the 
channels of pp —>• bbH/A —> bbtt and pp —> tbH± —>• tbtb, and the reconstructed top 
pair accompanying the Higgs production in pp —>• ttH/A —> tttt. 

Meanwhile, the kinematic features of the particles accompanying Higgs production can 
efficiently facilitate revealing the mtt or mtb peak of the signals. 

To fully extract the potential of a pp collider, particularly one at 100 TeV, in searching 
for the decoupled MSSM Higgs sector, we exploit the kinematic features discussed above by 
applying the multivariate Boost-Decision-Tree (BDT) method for the channels involving top 
quarks. As a demonstration of the effectiveness of the BDT method, we present, additionally 
to the sensitivity reach of the pp colliders at 14 and 100 TeV, the tagging efficiencies of 
both hadronic and leptonic top-jets, and their mutual fake rates as well as the faking rates 
by other jets [h, Z , W, b, etc.). The code implementing our analyses BoCA 0.1 is publicly 
available [33]. Although the studies focus on the MSSM Higgs sector for its theoretical 
motivation and experimental representativeness, the sensitivity reach can be projected 
straightforwardly to some other contexts such as two Higgs doublet models and composite 
Higgs models. The strategies developed for these Higgs searches might be applied to the 
other collider analyses as well. 

We organize this article in the following way. We shortly review the MSSM Higgs 
sector in the decoupling limit in Section 2, and introduce the strategies for constructing 
the BDT-based top-jet taggers (both hadronic and leptonic) and testing the decoupled 
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9HVV 

cos (j3 — a) 
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cos a / cos /3 


QHtt 

cos a / cos /3 


9AVV 

0 

A 

9Att 

cot (3 

9 Abb 

tan /3 


QAtt 

tan j3 


9h+ucL 

+ 75 ) + m u cot/3(l - 75 )] 

H± 

9h-uH 

-A^Vud[m d tan/3(l - 75 ) + m n cot/3(l + 75 )] 

9h+vI 

^mi tan/3(l + 75 ) 


9h~v\ 

tan /3(1 — 75 ) 


Table 2: Tree-level H/A and H ± couplings in the MSSM. Here V u d is the CKM matrix element 
present in the case of quarks. 

MSSM Higgs sector at pp colliders in Section 3. The discovery reaches and exclusion limits 
which might be achieved at 14TeV and 100 TeV are presented in Section 4. We summarize 
our studies and point out potential directions to exploration after this work in Section 5. 
More details on the collider analyses and the construction of the BDT-based top-taggers 
are provided in the Appendices. 

2 The Higgs Sector in the MSSM 

In the MSSM, the Higgs mass spectrum and their couplings with the SM sector and 
themselves depend, at tree level, only on two free parameters, often chosen to be the 
Higgs vacuum expectation value (VEV) alignment tan [3 = ^ and the mass of the CP-odd 
neutral Higgs boson tua (or the mass of the charged Higgs boson m#±) in the case with no 
CP-violation [29]. In comparison, the general type II 2HDM depends on more parameters 
at tree level [34, 35] . The additional free parameters include the masses of the three other 
Higgs bosons, and the mixing angle a defined within the CP-even neutral sector by 

( h\ ( — sin a cos cA f H±\ , 

yHJ y cos a sin a J ) 

if CP-symmetry is conserved [35]. In the MSSM these parameters are correlated to each 
other. For example, the mixing angle a is fixed by tan/3 and rriA with the relation (e.g. 
see [36]) 

1 ( r, m A + M 7 \ 7T . . 

a = - arctan tan 2/3 —^ 1 — — < a < 0 . (2-2) 

2 \ m A ~ M z J 2 

In addition to the SM sector and with themselves, the MSSM Higgs bosons can couple 
with superparticles directly, which may significantly alter their productions and decays. 
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Figure 1: Contours for the branching ratios of CP-even neutral Higgs to top pairs (a), CP-odd 
neutral Higgs to top pairs (b) and charged Higgs to a top-bottom pair (c) in the MSSM. 


Instead of giving a full phenomenological consideration, we assume a decoupled SUSY 
sector [37]. We neglect supersymmetric corrections to conventional Higgs productions and 
decays, tolerate a potential deviation of the SM-like Higgs mass from the observed value, 
and turn off the Higgs decays into superparticles. Such a treatment ensures the analyses be 
less model-dependent, and enables us project the sensitivity reach to a specific model more 
easily. 

In the decoupling limit, the couplings take the values of the 2HDM alignment limit 

9hvv = 9hZA = 9hWTH± oc cos(/3 - a) —)■ 0 (2.3) 

yielding a generic suppression in sensitivities for the search modes (see Table 2): H —> VV, 
A —> hZ, and H± —> hW ± . As for H —>• hh, though the involved coupling guhh is n °t 
completely suppressed in the alignment limit, its decay width is inversely proportional to 
mu, yielding a suppressed sensitivity for this mode in the case of large rripj (e.g., see [24]). 
In the decoupling limit, therefore, H/A mainly decay into bb , tt for large and intermediate 
tan j3 and into tt for intermediate and low tan (3. H± overwhelmingly decay into tb except 
in the parameter region with relatively small m#± and large tan (3 where the branching 
ratio of H± into tv is not negligibly small. The total decay width of these Higgs bosons 
varies at percent level or even below, scaled by their mass. 

The contours for branching fractions of H/A -A- tt and H ^ — > tb are shown in Figure 1. 
The figure indicates a clear tan /3 suppression for these branching fractions. Compared to 
Br (H/A -A- tt ), however, Br(FP 1 —> tb) is much less suppressed in moderate and large tan /3 
regions. This is mainly because the coupling is at tree level a linear combination of 

the t and b Yukawa couplings, with the latter yileding a tan /3-enhanced contribution to the 
H± decay width. In spite of this, Br(F7 ± —> tb) is relatively small in the upper-left corner, 
due to additional phase-space suppression. 

For similar reasons, the productions of these BSM Higgs bosons via vector boson 
fusion or Higgs Strahlung are highly suppressed. Gluon fusion is the most important 
production mechanism of H/A in the low tan /3 region, and bb associated production becomes 
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Figure 2: Contours for the production cross-sections of neutral Higgs (a), neutral Higgs in association 
with two bottom quarks (b) and charged Higgs in association with a bottom and a top quark (c) in 
the MSSM, at a 100 TeV pp-collider. 


dominant in the moderate and large tan/3 regions (see Figure 2). The mild enhancement 
of cr(pp —> H/A) in the large tan/3 region is mainly caused by the contribution from the b 
quark loop. Naturally we expect that pp —>• H/A —>• tt and pp —> bbH/A —> bbrr , bbbb yield 
the best sensitivity in the low and large tan /3 regions, respectively. One subtlety, however, 
arises from the interference between the signal and the QCD background. It has been known 
for a while that the peak of the invariant mass mu distribution for the pp —>• H/A —> tt can 
be distorted into a rather complicated peak-dip structure or even be smoothed away, as mA 
increases [30, 31]. This potentially disables any resonance-reconstruction-based analysis, 
and necessitates the development of new strategies. Such an exploration is beyond the 
scope of this paper. Instead, we propose to search in channels with bb, tt associated Higgs 
productions, where the interference effects are less severe. 



Figure 3: Contours for the cross-sections of neutral Higgs decaying to a top pair (a), neutral Higgs 
produced in association with two bottom quarks and decaying to a top pair (b) and charged Higgs 
produced in association with a bottom and a top quark decaying to a bottom and a top quark (c) in 
the MSSM, at a 100 TeV pp-collider. 
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Figure 4: Contours for the cross-sections of neutral Higgs decaying to a top pair (a), neutral Higgs 
produced in association with two bottom quarks and decaying to a top pair (b) and charged Higgs 
produced in association with a bottom and a top quark decaying to a bottom and atop quark (c) in 
the MSSM, at the LHC. 


A remarkable observation however is: the moderate tan/3 region can be efficiently 
probed via the channel pp -A bbH/A -A bbtt, compared to the large and small tan /3 regions. 
Explicitly, we present the contours for a(pp -A bbH/A)Bv(H/A -A tt) at a 100 TeV pp collider 
in Figure 3. For reference, we also present the contours of a (pp -A H/A)Bi(H/A -A tt), 
with the interference effects neglected. Indeed, &(pp -A bbH/A)Bv(H/A -A tt) is maximized 
in moderate tan /3 region. Though its largest value for mA ~ 1 TeV is one to two orders 
smaller than that of cr(pp -A H/A)Bi(H/A -A tt), they become comparable as mA increases 
towards ~ 10 TeV. Given that the channel pp -A bbH/A -A bbtt carries more kinematic 
features due to the two additional b quarks, we may well expect that this channel can yield 
a high sensitivity in probing for moderate tan j3 region. This is true, as will be illustrated 
below. A combination of these channels does leave no “wedge” open around moderate tan /3, 
given a mA within the energy reach of the LHC or the 100 TeV pp collider, if the low tan /3 
region can be efficiently probed. 

The H± production, it is dominated by pp -A tbH± in both large and small tan /3 regions 
(see Figure 2). This is simply because the coupling gn±tb receives both tan/3-enhanced 
and tan /3-suppressed contributions at tree level. Given that H± -A tb is the main decay 
mode in the decoupling limit, the channel pp -A tbH± -A tbtb is expected to be the golden 
channel for the heavy H± searches. The contours for a(pp -A 6tiL =t )Br(IL :t -a tb) at a 
100 TeV pp collider is also presented in Figure 3. Because of the effects discussed above, 
if niA = rn H ±, the largest o{pp -A btH ± )Bi(H ± —> tb) is typically larger than the largest 
a(pp -A bbH/A)Bi(H/A -A tt) and a(pp -A H/A)Bi(H/A -A tt). For example, the 10 ab 
contour reaches m H ± 20 TeV for H ± , in comparison to mA ~ 10 TeV. 

The contours for a(pp -A bbH/A)Br(H/A -A tt) and cr(pp -A H / A)Br(H / A -A tt) and 
for cr{pp -a 6 tiL ± )Br(iL ± -A tb) in the MSSM at the LHC are presented in Figure 4. As 
a comparison, the 10 ab contours recede to mA ~ 2 TeV and m#± ~ 4 TeV, respectively. 
These contours can help understand the LHC and HL-LHC sensitivities in this regard which 
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Figure 5: Transverse momentum distribution of the hardest top (a) and the hardest lepton (b) 
for background ( ttbb ) and signal ( bbH/A , tbH ± ). Additionally, we show the opening angle between 
lepton and &-quark coming from the decay of the hardest top (c). The vertical line at A R = 0.5 
represents the jet radii we have chosen. The areas under all curves are normalized. 


will be discussed below. 

All cross-sections are calculated to leading order with MadGraph 5.2.1.2 [38], in the 
case of signal cross-sections without applying any pre-cuts. Additionally these results are 
crosschecked with SusHi 1.4.1 [39]. Identical calculations for y/s = 8TeV match previous 
results for neutral [40-42] and charged Higgs [42-45]. Our results for the branching ratios 
are calculated with HDECAY [46]. For our LHC analysis we apply a k-factor of 1.21 to the 
background [47]. 

3 Strategies for Collider Analyses 
3.1 Collider Analyses and BDT 

In this article we mainly analyze the discovery reaches and exclusion limits of searches for 
both neutral and charged Higgs bosons at a 100 TeV pp-collider. We cover the channels 
summarized in Table 1, except for the channel pp —> bbH/A —>• bbbb, given that its coverage 
over parameter space largely overlaps with that of the channel pp —> bbH/A —>• bbrr, and the 
channels pp —>• H/A —>• tt, pp —>• ttH/A —>• tttt, which are potentially sensitive in probing for 
the low tan /3 region. The corresponding backgrounds are summarized in Table 3. Because 
of their significance, we also present the pp —>• bbH/A —>• bbtt and pp —>• tbH ^ —>• tbtb 
sensitivities that might be achieved at the LHC. 

The analyses are largely based on two classes of kinematic features. The first one is 
related to the heaviness of Higgs bosons in the decoupling limit. In such a case, the decay 
products of Higgs are strongly boosted, resulting e.g. for H/A — y tt in two boosted tops 
and for — > tb in one boosted top. In the analyses with intermediate top quarks, we 

assume that the more boosted one decays leptonically and optimize the analyses based 
on this assumption. The distribution of the hardest top-quark and the hardest lepton are 
depicted in Figure 5a and 5b, respectively. The decay products of these tops tend to lie 
within a single jet cone (cf. Figure 5c), yielding a top jet. Additionally, the events have a 
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(a) 6-quarks accompanying Higgs 
production 


(b) Abetween the two accompa¬ 
nying 6-quarks 


Figure 6: Rapidity distributions of the 6-quarks accompanying Higgs production (a) and the 
distribution of the difference in rapidity between these two 6-quarks (b). The graphs show background 
( ttbb ) and signal ( bbH/A , tbH ± ) for 14TeV and 100TeV. We choose the 6-quarks accompanying 
the top production in ttbb for comparison. For 14 TeV and 100 TeV a cut on the jet transverse 
momentum of 20GeV and 40GeV has been applied, respectively. The vertical line at \r)\ = 3.5 in (a) 
represents the tracker coverage assumed for an 100 TeV pp-collider, for jets with larger rapidity 
6 -tagging is not applicable. The areas under all curves are normalized. 


large sum of scalar transverse momenta (H?)- These features becomes less prominent for 
smaller Higgs masses, especially for the mass range accessible to the LHC. 

Another class of kinematic features is related to the particles accompanying the Higgs 
production. For example, in the case of bbH/A , the two 6-jets accompanying the Higgs are 
less boosted, but tend to have a large rapidity. Hence, the difference in rapidity (At/) is 
typically larger than that of 6-jet pairs in the main backgrounds ttbb and ttcc. Therefore, 
A?/ can serve as a strong discriminator if the two accompanying 6-jets are identified. The 
story is similar in the case of f6iL ± , where the second b-jet is from top decay. However, 
the rapidity of many of these 6-jets is larger than the coverage of the tracker of a LHC-like 
detector design (|?/| < 2.5), as is indicated in Figure 6. Hence it is difficult for these 6-jets 
to be identified. Even if these 6-jets are tagged, we are still confronted by a combinatorial 
background caused by 6-jets from Higgs decays which are typically central and hence easier 
to be tagged. 

Addressing these difficulties is not easy, however, it provides an opportunity to explore 
potential guidelines for optimizing the detector design of an 100 TeV pp-collider, as well as 
for searching for heavy resonances at such a collider, given the essential role played by an 
extended Higgs sector in particle physics. In this article, we assume a tracker coverage of 
|t/| < 3.5 for future 100 TeV collider detectors. In response to the boostness kinematics and 
in order to suppress combinatorial background, we apply a multivariate approach by using 
a BDT. More explicitly, we apply BDT in each reconstruction step, aiming for a full event 
reconstruction. Below are the steps: 

1. Construct bottom BDT. With a BDT method, we are able to define 6-like jets (similar 
for t-like jets) which are characterized by their likelihood to be a 6-jet. 
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Analysis 



(a) BDT-based event reconstruction in the case of boosted tops. 


Delphes FastJet Analysis 



(b) BDT-based event reconstruction in the case of non-boosted tops. 


Figure 7: Steps undertaken during the BDT-based reconstruction of the signal pp —> H/Abb — » ttbb. 
Based on the EFlow output from Delphes we cluster jets with FastJet. Solid lines depict a 
reconstruction step where we combine more than one objects to form a new object. Dashed lines 
symbolize tagging steps. 

2. Construct top BDTs: one is hadronic and another one is leptonic. We build up 
BDT-based jet-taggers for boosted tops which mostly originate from the Higgs decays 
(cf. Figure 5c). In the case of less boosted tops we can not expect all top decay 
products to lie inside a jet cone and we have to reconstruct the W-bosons before we 
can use these to reconstruct the tops. 

3. Construct the Higgs BDT using the reconstructed tops or one reconstructed top and 
one 6-like jet for neutral or charged Higgs, respectively. 

4. Construct a BDT to demand two 6-like jets (or one top-like and one bottom-like jet) 
with large A r/ which accompany the Higgs production. We name it “bottom-fusion” 
BDT, though such kinematics may originate from a process other than bottom quark 
fusion. 

5. Construct the BDT for the whole event by properly combining the Higgs BDT and 
the bottom-fusion BDT. 

For illustration, the reconstruction steps undertaken for the pp —> bbH/A — > bbtt events 
with heavily boosted top-pair and unboosted top-pair are depicted in Figure 7a and in 
Figure 7b, respectively. The transition from boosted to unboosted events is gradual, which 
is taken into consideration by the top BDT. 

In the large tan /3 region, the decays into r-leptons contribute to the searches for both 
neutral and charged Higgs bosons. We require the two r leptons in pp —> bbH/A —> 66rr 
to decay semi-leptonically. In pp —> tbH ± — > tbru, we let both top quark and r decay 
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hadronically. Then we can use hard leptons from the r decay or large transverse mass to 
suppress backgrounds in these two cases respectively. Given their relatively simple collider 
kinematics, we apply a cut-based method in these two cases. 

Our analysis framework is defined in the following. We use SoftSusy 3.5.1 [48] as 
generator for the MSSM Higgs spectrum and simulate all events with MadGraph. The 
subsequent decays are performed by Pythia 6.426 [49]. We use Delphes 3.1.2 [50] to 
simulate a detector with CMS geometry (except using a tracker coverage |?y| < 3.5 for the 
100 TeV pp-collider), with pile-up turned off. For LHC simulations we keep the default 
values. For cut-based analyses we depend on Delphes jet-tagging, which we tune according 
to Drell-Yan samples with a transverse momentum cut of 40 GeV. The 6-tagging efficiency is 
70% with 20% misidentification of c-jets. The r-tagging efficiency is 60% and the fake-rate 
is 1 %. For BDT-based analyses we cluster jets based on Delphes energy flow observables 
with FastJet 3.0.6 [51], with a radius 0.5. In this case we switch off lepton isolation 
requirements, and instead require that the leading lepton has a transverse momentum 
larger than 50 GeV and 100 GeV, and that the missing energy is larger than 30 GeV and 
60 GeV, for the LHC and the 100 TeV collider, respectively. We demand that jets have a 
transverse momentum larger than 20 GeV and 40 GeV for the LHC and the 100 TeV collider, 
respectively, in all cases. In order to reduce the number of background events, we apply 
a pre-cut on the top or lepton transverse momenta or the missing transverse energy (cf. 
Table 3). The background cross-sections after all pre-cuts are summarized in Table 3. We 
use the TMVA 4.2.0 package [52] of the ROOT 5.34 framework [53] for BDT analyses. 

3.2 BDT-based Top-jet Taggers 

Each top BDT comprises a boosted top-jet tagger and modules for the reconstruction of 
unboosted tops. We apply both techniques to each top and rely on the technique with the 
better result, hence we choose the top jet with the best BDT response. The construction 
of top-jet taggers plays a crucial role in exploring physics with boosted tops. Thus we 
introduce in this section how to construct the hadronic and leptonic top-jet taggers, using a 
BDT method. 1 

For heavily boosted hadronic top jets the detector resolution may not allow for using 
sub-jet information. Hence we rely solely on the jet mass and variables making use of the 
secondary vertex information (cf. Appendix B.l). In order to suppress the fake rate of 
leptonically decaying tops in the hadronic top tagger, we veto against a hard lepton inside 
the jet cone. If the top-jets are not too heavily boosted, we make use of the kinematics 
of sub-jets [55], which are re-clustered using a kx jet algorithm in an exclusive way. We 
consider the case, where the top-jet consists of two sub-jets which resemble a boosted W -jet 
and a 6-jet respectively, and the case, where the top-jet consists of three sub-jets 2 . We 
assume that the cell resolution of a typical detector is around A R ~ 0.1 which imposes 

1 During the preparation of this article a BDT improved version of the HEPTopTagger has been pub¬ 
lished [54]. 

2 To build up the full top BDT, we incorporate two additional cases, where the reconstructed top consists 
of three separated jets, and where the reconstructed top consists of two separated jets with one resembling a 
boosted W -jet and another one resembling a fo-jet. 
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(a) 700 GeV < p J T (hadronic) < 1000 GeV 


(b) 1000 GeV < (hadronic) < 1500 GeV 



(c) 500 GeV < pip (leptonic) < 1000 GeV (d) 1000 GeV < p J T (leptonic) < 1500 GeV 


Figure 8 : Misidentification rate of the hadronic (a) and (b) as well as leptonic (c) and (d) top- 
taggers as functions of the tagging rate. We consider only the case of highly boosted top jets and do 
not use sub-jet information. Quark-jets (including t, 6 , c and q = u,d) are produced in a Drell-Yan 
process; 5 -jets are produced via QCD processes; the SM h-jets are produced via gg —> hh, with 
Br(h — > bb) = 100% assumed; and W- and Z- jets are produced via di-boson production, and decay 
in a standard way. All samples are subject to a pre-cut on their transverse momenta, and are 
generated in the same size. We present two choices of pre-cuts for each tagger. In the hadronic 
case we generate the jets with a parton level pre-cut of 500 GeV and 1000 GeV and additionally 
require the jets to fall into the transverse momentum windows 700 GeV < p 3 T < 1000 GeV and 
1000 GeV < pip < 1500 GeV, respectively. In the leptonic case we generate the jets with a parton 
level pre-cut of 500 GeV and 1000 GeV and additionally require the jets to fall into the transverse 
momentum windows 500 GeV < p J T < 1000 GeV and 1000 GeV <p? T < 1500 GeV, respectively. We 
do not consider the effects caused by pile-up. 


- 12 














an upper limit on the substructure resolution. We rely solely on traditional kinematic 
variables of the jets and sub-jets. This can be further improved with variables based on 
more advanced substructure information, such as pull [56] or di-polarity [57], which should 
help to suppress the fake rate of jets with a different color-flow compared to top jets. 

In the case of leptonic top jets, it becomes crucial to identify the lepton together with 
the hadronic activity. As the lepton is not isolated it has to serve as the foundation for 
a boosted leptonic top tagger [58], which poses more challenges in the case of electrons 
compared to muons. The development of the necessary non-isolated lepton taggers lies out 
of the scope of this paper. Therefore, we have assumed that it will be possible to identify 
leptons in a hadronic environment. 

We have tested these top taggers on various backgrounds. The resulting misidentification 
rates as a function of the top-tagging rate are presented in Figure 8. We present the 
miss-identification rate for two bins each in which all jets are generated with the same 
parton level pre-cut of p? T > 500 and 1000 GeV, respectively. Additionally, we require the 
jets to fall into a transverse momentum window of 500, 700 GeV < p^ T < 1000 GeV and 
1000 GeV < pj, < 1500 GeV, respectively. We apply a slightly different lower cut on the 
lower of the two px windows for leptonic (500 GeV) and hadronic (700 GeV) tagger, in order 
to compensate for the missing transverse momentum in the leptonic case. For the hadronic 
top tagger the most important background consists of Higgs jets followed by 6, c and h jets. 
For the leptonic top tagger the most important backgrounds are hadronic top and b jets. 
We stress that the faking rates shown in Figure 8 are defined inclusively, hence the BDT 
background is defined to a combination of all listed background jets. At an exclusive level, 
the fake rates can be further optimized. 

4 Prospects at 14 and 100 TeV pp Colliders 

The exclusion limits and discovery reaches which are yielded by these channels at the LHC 
and a 100 TeV pp collider with a CMS-like detector (a tracker coverage of \r]\ < 3.5 is 
assumed for the 100 TeV machine) are presented in Figure 9 and Figure 10. We provide 
limits for the LHC with an integrated Luminosity of 0.3 ab -1 and 3ab _1 which corresponds 
to the expectation of the collected data after the third run and an upgrade to the HL-LHC, 
respectively. For the 100 TeV collider we show the contours for a luminosity of 3ab _1 and 
30ab _1 . Though systematic errors are not considered, we believe that incorporating them 
will not qualitatively change conclusions reached in this paper. 

For one to interpret these results straightforwardly in some other extended Higgs 
sectors, e.g., the one in the 2HDM, we present the exclusion limits and discovery reaches 
of the channels bbH/A —> tpti and tbH ^ —> thbtib at 14 TeV and 100 TeV in Figure 9, in a 
model-independent way. A generic feature of these sensitivity reaches is that they tend 
to be higher, for a larger via or vi H ± within the range under exploration. This is simply 
because the boostness kinematics plays a crucial role in suppressing the background. 

The interpretation of these results in the MSSM are presented in Figure 10. For 
moderate tan [3, the cross-section for the associated charged Higgs channel (cf. Figure 3c) 
and the associated neutral Higgs channel (cf. Figure 3b) are comparable to each other. 
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Figure 9: Model independent reaches of bbH/A —>■ thtibb at (a) and (c) as well as tbH ± —>• thbtib 
in (b) and (d). In the upper figures (a) and (b) the energy is 100 TeV. In the lower figures (c) and (d) 
the energy is 14 TeV. The Luminosity in all figures is 3 ab -1 . The green area represents the discovery 
reach and the orange band shows the exclusion reach. The constraints are weaker at the 100 TeV 
collider compared to the LHC, because of the bigger background cross-sections. 
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(c) Exclusion reach for neutral Higgs 


(d) Exclusion reach for charged Higgs 


Figure 10: Discovery reaches (a) and (b) and exclusion limits (c) and (d) of the MSSM Higgs 
bosons. For the neutral Higgs bosons (a) and (c), the blue and orange regions are probed for by 
the channels pp — > bbH/A —> bbr h Ti , pp —» bbH/A —> bbt h ti , respectively. For the charged Higgs 

bosons (b) and (d), the blue and orange regions are probed by the channels pp —► tbH ± —>■ tbThV 
and pp tbH ± — > thbt-ib , respectively. Given the same channel or the same color, the two different 
opacities indicate the sensitivities w.r.t. a luminosity of 3ab _1 and 30ab _1 at a 100 TeV pp 
collider, respectively. The cross-hatched and diagonally hatched regions are the predicted discovery 
contours (or exclusion contours) for associated Higgs production at the LHC for 0.3ab” 1 , and Sab” 1 , 
respectively. 
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Therefore, the analyses reach a roughly comparable sensitivity for this parameter point, 
given that they share similar backgrounds for the setup under consideration. In the case of 
charged Higgs the cross section and the sensitivity increases when tan /? moves away from 
this point. In the case of neutral Higgs the cross-section and sensitivity decreases when 
tan /? moves away from this point. 

For the energies reachable by the LHC and a luminosity of 3ab -1 , we are able to 
exclude the moderate tan/? region up to ~ 1 TeV, via pp —> bbH/A —>• bbtt for neutral 
Higgs bosons and via pp —>• tbH± —>• tbtb for charged Higgs bosons. The search for low 
mass charged Higgses in the moderate tan/? region is strongly affected by the pre-cuts 
on lepton transverse momentum and missing transverse energy and effectively keeps a 
small unprobed area, which we hope to cover with a slightly improved analysis. This is 
indicated in Figure 10. Combining with pp —>• bbH/A —>• bbrr and pp —>• ttH/A —>• tttt (or 
pp —^ H / A —>• tt) , a full coverage of tan /? might be achievable for neutral Higgs searches, 
though a dedicated analysis is yet to be done for the latter. As for the charged Higgs 
searches, the pp —>• H±tb —> ttbb channel covers additionally the lower and higher tan /? 
region up to more than 2 TeV. The discovery regions are more tightly constrained, at the 
HL-LHC the charged Higgs can be discovered in the associated channel for high and low 
tan /? up to ~ 2 TeV. 

At the 100 TeV collider, a combination of pp —> bbH —> bbrr and pp —> bbH/A —> bbtt 
pushes the exclusion limit for the neutral Higgs searches up to rriA ~ 10 TeV, except for the 
low tan /? region. For pp — > bbH/A —>• bbtt, the low mass region has a worse sensitivity than 
the intermediate mass region, due to the pre-cuts on lepton transverse momentum and the 
missing transverse energy. In Figure 10, we also present the sensitivity reach of the t^ti 
resonance search for reference (dashed red curves), with the interference effect between signal 
and background ignored. We however should not interpret it as the real reach for the neutral 
Higgs searches in the channel of pp — > H/A —> tt at a 100 TeV collider, since the interference 
effect can dramatically change the resonance structure. Concurrently, pp —>• tbH± tbtb 
pushes the exclusion limit for the charged Higgs searches up to m#± ~ 10 TeV with a full 
coverage of tan /?, with additional coverage up to m H ± ~ 20 TeV for both high and low 
tan /? regions. Discovery of neutral and charged Higgs will be possible up to 10 TeV and 
10-20 TeV, respectively. In summary, with the channels under study, the HL-LHC and the 
100 TeV pp-collider has a potential to push the sensitivity reach for H/A and H± from a 
scale of 0(1) TeV up to a scale of 0(10) TeV, respectively. 

5 Summary and Outlook 

The BSM Higgs sector is one of the most important physics targets at the LHC and at 
next-generation pp colliders. In this article, we present a systematic study testing the 
MSSM Higgs sector in the decoupling limit, at 14 TeV and 100 TeV machines. We propose 
that the “top” decay channels ( H/A —»• tt and H ^ —> tb) in associated Higgs productions, 
should play an essential role. These channels are typically characterized by kinematics with 
highly boosted Higgs decay products, and large forwardness/backwardness of the particles 
accompanying the Higgs production (pp —>• ttH/A is an exception, where tt is less forward 
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or backward). Facilitated with a BDT method, the LHC will be able to cover the “wedge” 
region for neutral Higgs searches and hence test the Higgs sector up to 0(1) TeV for both, 
high and moderate tan (5 region. We show that a future 100 TeV pp-collider has a potential 
to push the sensitivity reach up to 0(10) TeV. 

The analyses pursued in this article represent a preliminary effort in this regard. A 
more complete study is definitely necessary. Below are several directions that are interesting 
to explore in our view: 

• Although, the channel pp —> bbH/A — > bbtt enables us to cover the “wedge” or 
moderate tan /3 region for neutral Higgs searches in the decoupling limit, its sensitivity 
gets reduced below or around the threshold mn / tua = 2 rrit , where this channel either 
becomes kinematically forbidden at an on-shell level or yields soft decay products. 
However, we may take similar strategies, combining the bbH/A production with some 
other dominant decay modes in low tan f3 region, such as H —> hh and A —> hZ. Then 
the unprobed “wedge” region might be covered by the channels pp —> bbH —> bbhh 
and pp —> bbA —?- bbhZ. 

• A realistic analysis for probing the low-tan f3 region in the neutral Higgs searches 
is still absent, largely because of the large interference effect between the channel 
pp —» H/A —» tt signal and QCD tt background. To address this question, developing 
new strategies are definitely necessary and important. One possible way out is to 
search for pp —>• ttH/A —> tttt. Although its cross section is relatively small, compared 
to the other channels, the strategies developed in this article might be of help. Both 
topics are under exploration. The results will be presented in a future article [59]. 

• Although, the studies in this article are focused on the MSSM Higgs sector, it is 
straightforward to project their sensitivity reach to some other BSM scenarios, such as 
the 2HDM. Additionally, the exploration can be generalized to exotic search channels. 
Such channels might be switched on in a 2HDM model, but are generically suppressed 
in the MSSM [60-63]. Dedicated analyses are certainly necessary. We leave this 
exploration to future work. 

Note added: While this article was in finalization, the papers [64, 65] appeared, which 
partially overlap with this one in evaluating the LHC sensitivities in searching for pp —> 
bbH/A —>• bbtt and pp —>• tbH ^ —>• tbtb. However, we notice a difference between the LHC 
sensitivities obtained in [65] and in our analyses, which enable us to conclude that the 
LHC has a potential to fill up the well-known “wedge” (that is, the region with moderate 
tan f3) up to ~ 1 TeV, with 3000 fb _1 , via such channels. In spite of this, we are focused 
more on testing an extended Higgs sector at a future 100 TeV pp-collider, exploring the 
collider kinematics involved and developing its (BDT-based) search strategies, which are 
not covered in [65]. 
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A Background Generation 


All background cross section are calculated with MadGraph and given to leading order. In 
order to reduce the necessary background we apply different pre-cuts depending on Higgs 
mass. 

For the pp —> bbH/A —>• bbtt and pp —>• tbH± —y tbtb analyses the inclusive tt background 
is generated with a pre-cut on the top transverse momentum of pr(t) = 300,1500,2500 GeV 
for Higgs masses equal or larger to 1000, 4000, 6000 GeV, respectively. For the LHC analyses 
we have applied a pre-cut of 250 GeV for masses equal or larger than 1000 GeV. 

In the pp — > bbH/A —> bbrr analysis the irreducible background bbZ/ 7 * —> bbrr and 
the reducible backgrounds ccZ/ 7 * —> cctt, tt —>• bbrrvu and tt —> bblrv are considered. 
We generate background samples with a pre-cut on the lepton transverse momentum of 
150 GeV for Higgs masses equal or larger than 1000 GeV, and 700 GeV for Higgs masses 
equal or larger than 3000 GeV. 

For the analysis pp — > btH± —> btrv, the irreducible background is tt —> bbjjru and 
the reducible backgrounds are tt —> bbTTvv and tt —> bblrvu. The background tt —>• bbjjTv 
can be suppressed by a large transverse missing energy together with a requirement on a 
large transverse mass. Hence we generate tt —>• bbjjrv with transverse missing energy larger 
than 300 GeV for Higgs masses larger than 1 TeV, and for Higgs mass larger than 3 TeV, 
the irreducible background can be well suppressed. For tt —>• bbTTvv and tt —>• bblruv , the 
transverse missing energy is required to be equal or larger than 300 GeV and 600 GeV for 
Higgs masses larger than 1 TeV and 3 TeV, respectively. All background cross-sections and 
the generated Luminosities are collected in Table 3. 

B BDT-based Analyses 

The analyses for pp —>• bbH/A —> bbtt and pp —> tbH± —>• tbtb are based on BDT. 

B.l Boosted Decision Trees (BDT) 

A decision tree consists of a series of cuts classifying an event to be either signal or 
background. Subsequent cuts of the decision tree are also applied to events previously 
classified as background, yielding shapes in the parameter space which approximate the 
signal region much better than the usual rectangular cuts. The decision tree is trained 
on a training sample with truth level information; in a second stage it is applied to the 
testing sample without truth level information. A decision tree can lead to a perfect 
separation between signal and background in the training sample. However, this goes with 
an over-training effect where subsidiary cuts deal only with statistical effects of the training 
sample. This can be seen by comparing the efficiency of the decision tree between the 
training sample and a test sample. An enhancement to the decision tree dealing with this 
problem are boosted decision trees. While the basic decision tree algorithm stays the same, 
it gets applied multiple times, on re-weighted samples. The weighting factors are calculated 
from the region in parameter space where the decision tree has the lowest discriminating 
power. BDT techniques have been introduced to high energy physics in [66] and have since 
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Signal 


Background 

Precut [GeV] 

o- [pb] 


£gen 

[fb- 1 ] 



p T (t) pr(l) Ef iss 

100 TeV 14 TeV 

100 TeV 

14 TeV 




2500 


0.0444 


1570 





1500 


0.505 


384 


HI Abb - 

-> ttbb 

tt —> bbjjlv 

300 


356 


5.71 





250 



9.16 


76.2 




0 


7130 

166 

0.384 

10.8 




2500 


0.0884 


3453 





1500 


0.894 


255 


H^b -j 

• ttbb 

tt —>• bbjjlv 

300 


375 


3.12 





250 



7.05 


141 




0 


7130 

166 

0.384 

10.8 





700 

0.000294 


34000 




bbZ/'y* bbrr 


150 

0.109 


4600 






0 

37.4 


13.4 






700 

0.000234 


42700 




ccZ^* —> CCTT 


150 

0.0918 


5450 


H/Abb - 

->• rrbb 



0 

65.8 


7.60 




700 

0.00187 


4120 




tt —>• bbrrvv 


150 

0.956 


490.0 






0 

135.0 


4.76 



tt —)■ bblrvv 


700 

150 

0 


0.00736 

3.74 

600.0 


2590 

184.0 

2.13 


H ± 


TV 


tt - 

■» bbjjrv 

300 

0 

4.74 

428.0 

26.1 

1.71 



600 

0.0886 

1990 

tt - 

■* bbrrvv 

300 

0.943 

117.0 



0 

135.0 

4.76 



600 

0.388 

1540 

tt - 

■* bblrvv 

300 

4.32 

36.8 



0 

600.0 

2.13 


Table 3: Cross-sections and generated luminosities of the relevant background processes after precut. 
All tt backgrounds are matched up to two jets including &-jets. The pre-cut pr{t) has been applied 
to both top quarks or the leading top quark and the leading non-top quark for neutral or charged 
Higgs, respectively. The pre-cut pr{l) has been applied to all event leptons including r-leptons. In 
our analysis we apply a k-factor of 1.2 to the LHC cross-sections [47]. 
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then seen a large adoption especially in the experimental community, including [67-71]. For 
a short introduction see [72], We make use of the TMVA package of the ROOT framework. We 
are using the default AdaBoost algorithm and apply bagging, in order to further reduce 
over-training. Our analysis code BoCA 0.1 is publicly available [33]. 

In order to classify the events of the training sample as signal or background we identify 
jets with their nearest quark. Additionally, we define a set of observables according to 
which the BDT is trained. In the testing and application phase the BDT is applied to a 
sample without truth level information and returns the signal likeliness of each element 
in this sample. We combine multiple BDTs tailored to reconstruct single particles into 
a chain, which we apply to our events. In a first step, we construct a simple bottom jet 
BDT tagger. For boosted leptonic and hadronic top jets we develop two BDT taggers. For 
non-boosted tops, on the other hand, we first reconstruct hadronic and leptonic VF-bosons, 
subsequently we combine these VF-bosons with a 6-jet. In the next step we reconstruct 
neutral or charged Higgs from two tops or one 6-jet and one top, respectively. Additionally, 
we search for the bottom-fusion jet pairs with large difference in rapidity. Finally, we 
combine the reconstructed Higgs and the bottom-fusion pair and distinguish signal and 
background events with a last BDT analysis. 

1. Bottom Tagger 

In order to distinguish bottom jets from light jets we make use of the finite lifetime of 
the intermediate bottom mesons. A bottom jet has multiple displaced vertices with a 
larger invariant mass and energy fraction compared to light jets. Therefore, we train 
the bottom jet BDT on 

Vertex Mass invariant mass of tracks with displaced vertex my 
Multiplicity number of tracks with displacement Nt 

Displacement cylindrical distance between the primary vertex and the secondary 
vertices Ar 

Energy fraction between tracks with and without displacement te 

Radius of the jet associated with a secondary vertex A Rj 

Spread concentration of transverse momentum towards the center of the jet 
y- &R(3c3)Pt 
^3c£i p^ARj 

2. Top Tagger 

For highly boosted tops we identify the top jets solely based on the global jet variables, 
the information originating in the displacement of the 6-mesons and the possible 
presence of a lepton inside the jet cone. Additionally to the variables used for the 
bottom tagger we use 

Mass of the jet 

Hardness of the leading lepton inside the jet cone in the case of leptonically decaying 
tops. For hadronically decaying tops we veto on hard leptons inside the jet cone. 
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(a) 500 GeV < p J T < 1000 GeV 


(b) 1000 GeV < pip < 1500 GeV 


Figure 11: BDT distributions for leptonic top. We present signal (solid) and background (dotted) for 
two different transverse momenta regions. The low px region comprises of 500 GeV < pip < 1000 GeV 
and is presented in Figure (a). The high px region comprises of p J T > 1000 GeV and is presented in 
Figure (b). All samples have a size of 10000. For their definition, see the caption of Fig. 8. 


Figure 11 shows the BDT-response of the leptonic top tagger to different jets with 
the same boostness. The light-, W- and Z -jet fake rate is mostly suppressed by the 
displacement information. The track multiplicity and vertex mass reduces the fake 
rate for /i-jets as the main decay channel into 6 -quarks yields more secondary vertices 
compared to leptonic top jet. The jet mass distribution of the heavy quarks are 
centered around the respective quark and boson masses, which reduces the fake rates 
of theses jets further. On the other hand, the jet masses of the light quarks and gluons 
form a very broad slope, maximized at zero, which makes it hard to suppress the fake 
rate of these jets with the jet mass information. 

In Figure 12 we present the BDT-response of the hadronic top tagger to different jets 




(a) 700 GeV < pip < 1000 GeV (b) 1000 GeV < p J T < 1500 GeV 

Figure 12: BDT distribution of the hadronic top tagger. We present signal (solid) and background 
(interrupted) for two different ranges of transverse momenta. The particles have been generated 
with a transverse momentum pre-cut of 500 and 1000 GeV, respectively. Additionally the jets are 
required to fall into the mass-windows mentioned in the figure description. All samples have a size 
of 10000. For their definition, see the caption of Fig. 8. 
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with the same boostness. Also in this case the main discriminator against light-, W- 
and Z-jets is the presence of secondary vertices. Higgs jets are suppressed by the 
energy fraction between tracks with and without secondary vertices. The presence of 
an hard lepton suppresses the leptonic top. In the case of less boosted hadronic top 
jets we make use of the information provided by its decay products. Hence, we probe 
if the jet can be re-clustered into two sub-jets to resolve the 6 and W component or 
into three sub-jets to resolve additionally the decay products of the W. 

3. Top reconstruction 

For non-boosted top jets we have to assume that the W and the b are located in two 
or three different jets. Hence, we perform a W reconstruction whenever necessary, 
which combines two (sub-)jets and probes on 

Invariant mass of the reconstructed object 

Momentum-space position of the reconstructed object ( pr , rj, 0) 

Angular difference between both elements of the reconstructed object (A 77 , A</>, 
A R) 

Momentum difference A px 

BDT response of the preceding reconstruction step; In the case of hadronic W- 
reconstruction this implies a veto on 6-jets. 

In a second step we combine the the reconstructed or tagged IT’s with a 6-jet to a 
top and train the top BDT on the same variables as in the VF-BDT. 

Finally, we have to take the case of non-boosted leptonic top into account. We consider 
the hardest lepton as possibly originating from a W decay. In order to reconstruct 
these leptonically decaying W-bosons, we reconstruct the neutrino four momentum 
from the lepton momentum and missing energy. Whenever the appearing quadratic 
equation has complex solutions we iteratively move the missing energy vector towards 
the lepton until the solution becomes real. Of these two solution we train the BDT 
on the one that coincides better with the truth level neutrino. 

4. Heavy Higgs reconstruction 

For the Higgs reconstruction we combine two tops and one top with one bottom for 
neutral and charged Higgs, respectively. We train the BDT on the same variables as 
in the top reconstruction (cf. Step 3). 

5. Bottom Fusion BDT 

Additionally we train a BDT on the bottom fusion pair accompanying the Higgs. The 
most important discriminating variables in this case are the large rapidity difference 
between these two jets and their 6- (top-) likeliness. We demonstrate the performance 
of this tagger on a vector boson fusion samples in Figure 13a. 
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(a) Bottom fusion pair BDT. (b) H/Abb event BDT. 

Figure 13: BDT distributions for the bottom fusion pair BDT (a) and the final event BDT for 
associated production of neutral Higgs (b) at a 100 TeV pp collider. The bottom fusion pair BDT (a) 
is tested with bb, cc and jj vector boson fusion and Drell-Yan samples. The large overlap between 
the bottom and the charm sample is due to the limited tracker coverage of |r/| < 3.5. For the event 
BDT (b) we show two examples of signal (solid) and background (dotted). 


6 . Event BDT 

In the last step we combine the reconstructed heavy Higgs with the bottom fusion 
pair and global event observables. The global variables contain 

Ht of the event 

Jet number 
Lepton number 

Bottom BDT response of the remaining jets 
Hardness of the remaining jets and leptons 

We illustrate the response of the event BDT for the case of neutral Higgs in Figure 13b. 

C Cut-based Analyses 

We analyze the processes pp —>• bbH/A —>• bbr^Ti and pp btH ^ —»• btr^u at 100 TeV, with 

a cut based approach. 

C.l Neutral Higgs 

For the semi-leptonic channel H/A —>• r/r^ with a large Higgs masses, the signal contains 
one hard lepton and one hard jets possibly tagged as r-jet. We present the analysis for 
m,H /rriA = 3 TeV for illustration. The cuts applied in our analysis are 

Cut 1 One lepton with p l T > 700 GeV and veto on a second lepton with p l T > 25 GeV 

Cut 2 Exactly one oppositely charged (Q(t) = — Q(l )) jet with r-tag and p r T > 700 GeV 

Cut 3 Transverse mass mp^l, F™ 1SS ) < 160 GeV 
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bbH - 

bbrr bbA - 

->■ bbrr bbZ/'y* - 

-> bbrr 

ccZ/ 7 * —> CCTT 

tt 







bbTTVV 

bblTisv 

iVgen 

10000 

10000 

10000 

10000 

7686 

19043 

N cut 1 

736 

756 

403 

451 

195 

5699 

-Vent 2 

192 

185 

64 

58 

124 

2899 

-Vcut 3 

123 

114 

31 

26 

39 

470 

-Vcut 4 

16 

10 

11 

4 

4 

91 

Ncut 5 

10 

6 

2 

0 

0 

7 

fs 

0.460 

0.460 

0.088 

0.07 

0.73 

1.16 


Table 4: Cut flows for pp —> bbH/A —$■ bbrr signal and background with TOjj/to^ = 3TeV. The scale 
factor f s is calculated for a luminosity of 3000fb” 1 and total cross section er = 1.53 fb, corresponding 
to tan /3 = 10. The signal and background samples are generated with r decaying inclusively. During 
the generation of the background sample, a pre-cut of pp > 700 GeV is applied to both leptons and 
the r-leptons. 


Cut 4 Exactly two b-tag jets with pj. > 40 GeV 
Cut 5 Ar/bb >2.5 

Here nip (l, E™' ss ) is transverse mass defined as 

m T (l, Ef ss ) = ^2p l T Ejb iss (1 — cos </>z, m iss) • (C.l) 

where (j>i m i SS is the azimuthal angle between lepton and the missing transverse momentum. 
The cut flows for signal and background are presented in Table 4. The resulting significance 
for exclusion is 1.86 cr for a luminosity of 3000 fb -1 . The resulting exclusion region at a 
100 TeV collider is shown in Figure 10. 

C.2 Charged Higgs 

The kinematics for the channel pp —> tbH± —> thr^v with a large Higgs mass is characterized 
by large missing energy, hard Thad and large mp^^d, Ep lss ). These features suppress the 
irreducible and the multi-jet background, particularly in the large Higgs mass domain. 

We present the analysis for m^± = 3 TeV for illustration. Below are the analyses cuts 
applied: 

Cut 1 > 4 jets, with exactly two 6-tagged, > 2.5, and veto of leptons with pp > 
25 GeV. 

Cut 2 exactly one hard (pp > 650 GeV) r-tagged jet, E™ 1SS > 650 GeV and transverse 
mass mp^r, E™ 1SS ) > 1300 GeV. 


- 25 - 
















Signal 

Background 



tbH ± -4- thru tt - 

4 bbTTVls tt —> 

bblrw 

-Vgen 

10000 

176203 

598242 

iVcut i 

491 

2334 

1874 

iVcut 2 

118 

20 

8 

fs 

0.152 

1.51 

1.95 


Table 5: Cut flows for pp —> tbH ± —> tbrv signal and background with m#± = 3TeV, where f s is 
the scale factor for a luminosity of 3000 fb -1 and total cross section a = 0.507 fb, corresponding 
to tan/3 =10. The signal and background samples are generated with r decaying inclusively. For 
m H ± = 3 TeV, the irreducible background tt -4 bbjjrv and multi-jet background are well suppressed. 


We present the cut flows for signal and background in Table 5. The significance for exclusion 
is 2.25 a for a luminosity of 3000 fb -1 . The cuts on E™ 1SS , pt(t\ ia d) and mp are modified 
for optimizing the analyses, as m#± varies. The resulting exclusion region at a 100 TeV 
collider is shown in Figure 10. 
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